SEQUENCE LISTING 



DMASIS- DNA Translation [T-INS] 
File Name : T-INS 

Range :. 2 - 728^ Mode : Normal 

Codon. Table : Universal 

' 3 ' 10 19\ 28 37 45 55 

GAG CTG AGC TGA ACT GGC \lCT CCT GGG GAC TCA GCA AGC TCT CTA GAG CCC CCC 



64 73 
ACA TGC TCC CCC ACC GGG 



82 91 100 109 

CCC CGG TTG CGT GAG GAC ACC TCC TCT GAG GGG 



118 127 
CTC CGC TCG CCC CTC TTC GGA 



136 145 154 163 

CCC GGG GCC CCG GCT GGC CAG AGG IATG GAC 



172 181 
GAG GAG GAG GAT GGA GCG GGC GCC 



V 




• 



J.90 199 208 217 

MS GAG TCG GGA CAG CCC CGT AGC TTC ACG 



226 235 244' 

CAG CTC AAC GAC CTG TCC GGG GCC GGG 



253 262 271 

CGG CAG GGG CCG GGG TCG ACG GAA 



Q L N D L 

280 289 298 \ 307 315 325 

AAG GAC CCG GGC AGC GCG GAC TCC GAG Gc\ GAG GGG CTG CCG TAC CCG GCG CTA 

K D P G S 



334 343 352 

GCC CCG GTG GTT TTC TTC TAC TTG AGC CAG 



361 370 379 

iC AGC CGC CCG CGG AGC TGG TGT 



388 397 406 V415 424 433 

CTC CGC ACG GTC TGT AAC CCG TGG TTC GAG CGA QTC AGT ATG CTG GTC ATT CTT 



V 



442 451 460 469\ 

CTC AAC TGT GTG ACT CTG GGT ATG TTC AGG CCG TGT 



L V I L 

478 487 
lG GAC ATT GCC TGT GAC 



L N 


C V 


T 


L 


G 


M 


F 


R 


P 


C 




D 


I 


A 


C 


D 


TCC CAG 


496 

CGC TGC 


CGG 


505 
ATC 


CTG 


CAG 


514 
GCC 


TTC 


GAT 


523 
GAC 




ATC 


532 
TTT 


GCC 


TTC 


541 
TTT 


S Q 


R C 


R 


I 


L 


Q 


A 


F 


D 


D 


F 


\ I 


F 


A 


F 


F 


GCT GTG 


550 

GAA ATG 


GTG 


559 
GTG 


AAG 


ATG 


568 
GTG 


GCC 


TTG 


577 
GGC 


ATC 




586 
GGG 


AAG 


AAA 


595 
TGT 


A V 


E M 


V 


V 


K 


M 


V 


A 


h 


G 


I 


f\ 


G 


K 


K 


C 


604 

TAC "CTG GGA GAC 


ACT 


613 
TGG 


AAC 


CGG 


622 
CTT 


GAC 


TTT 


631 
TTC 


ATT 


GTC 


«40 


GCA 


GGG 


649 
ATG 


Y L 


G D 


T 


W 


N 


R 


L 


D 


F 


F 


I 


V 


\ 




G 


M 


D0210\l 

































56° 



( 



2 



DNASXS- DNA TxcmsU.atIon [T-INS] 



658 

CTG GAG TAT TCG 



667 676 685 694 703 

GAC CTG CAG AAC GTC AGC TTC TCC GCA GTC AGG ACA GTC 



N 



V 



V 



712 V21 730 739 748 757 

CGT GTG CTG CGA CCG QTC AGG GCC ATT AAC CGG GTG CCC AGC ATG CGC ATT CTC 



V 



N 



M 



766 775\ 784 793 802 811 

GTC ACA TTA CTG CTG GAC V.CC TTG CCT ATG CTG GGC AAC GTC CTG CTG CTC TGT 



V T L L, L D 

820 829 
TTC TTC GTC TTT TTC ATC 



M 



N V 



838 847 855 865 

, GGC ATC GTG GGC GTC CAG CTG TGG GCA GGA CTG 



V 



V 



V 



W 



874 883 \ 892 901 910 919 

CTT CGC AAC CGA TGC TTC CTC CCC GAG AAC TTC AGC CTC CCC CTG AGC GTG GAC 



N 



928 937 . 

CTG GAG CCT TAT TAC CAG ACA GAG 



N 



V 



955 964 973 

GAG GAC GAG AGC CCC TTC ATC TGC TCT 



982 991 1000 

CAG CCT CGG GAG AAT GGC ATG AGA TCC 



1009 1018 1027 

AGG ACT GTG CCC ACA CTG CGT GGG 



N 



M 



1036 1045 1054 \ 1063 1072 1081 

GAA GGC GGT GGT GGC CCA CCC TGC AGT CTG\GAC TAT GAG ACC TAT AAC AGT TCC 

Y E T Y N S S 

1090 1099 1108 \lll7 ■ 1126 1135 

AGC AAC ACC ACC TGT GTC AAC TGG AAC CAG TAQ TAT ACC AAC TGC TCT GCG GGC 




N 



N W 



N 



1144 . 1153 1162 11V1 1180 1189 

GAG CAC AAC CCC TTC AAA GGC GCC ATC AAC TTT GSC AAC ATT GGC TAT GCC TGG 



N 



K 



N 



1198 1207 1216 1225 

ATC GCC ATC TTC CAG GTC ATC ACA CTG GAG GGC TGG 



N I G Y A W 

1234 1243 
GAC ATC ATG TAC TTC 



W 



M 



1252 1261 1270 1279 \ 1288 1297 

GTA ATG GAC GCT CAC TCC TTC TAC AAC TTC ATC TAC TTC VtT CTT CTC ATC ATC 



V M 



N 



1306 1315 1324 1333 \1342 1351 

GTG--GGC TCC TTC TTC ATG ATC AAC CTG TGC CTG GTG GTG AtA GCC ACG CAG TTC 



V 



M 



N 



V 



H:\004\00210\l 



3 



DNASIS DNA Translation [T-XNS] 

1360 \ 1369 1378 1387 1396 1405 

TCC GAG ACC AAA CAG CGG GAG AGT CAG CTG ATG CGG GAG CAG CGT GTA CGA TTC 



K 



M 



1414 U423 1432 1441 1450 1459 

CTG TCC AAT GCT AGC\ACC CTG GCA AGC TTC TCT GAG CCA GGC AGC TGC TAT GAG 

L S N A 

1468 147^ 1486 1495 1504 1513 

GAG CTA CTC AAG TAC CTG\ GTG TAC ATC CTC CGA AAA GCA GCC CGA AGG CTG GCC 



E L L K Y 

1522 1531 
CAG GTC TCT AGG GCT ATA 



K 



1540 1549 1558 1567 

GTG CGG GCT GGG CTG CTC AGC AGC CCA GTG GCC 



V R 

1576 1585 \ 1594 1603 1612 1621 

CGT AGT GGG CAG GAG CCC CAG OCC AGT GGC AGC TGC ACT CGC TCA CAC CGT CGT 

5GSCTRSHRR 

1630 1639 1648 1657 1666 1675 

CTG TCT GTC CAC CAC CTG GTC CAC CAC CAT CAC CAC CAC CAT CAC CAC TAC CAC 



V 



H H 



H H H H 



1684 1693 1702\ 1711 1720 1729 

CTG GGT AAT GGG ACG CTC AGA GTT CCCXCGG GCC AGC CCA GAG ATC CAG GAC AGG 



N 



V 



1738 1747 1756 \ 1765 1774 1783 

GAT GCC AAT GGG TCT CGC CGG CTC ATG CrAcCA CCA CCC TCT ACA CCC ACT CCC 



R 



M 



1792 1801 1810 \1819 1828 1837 

TCT GGG GGC CCT CCG. AGG GGT GCG GAG TCT GTA CAC AGC TTC TAC CAT GCT GAC 



, H 

1846 1855 1864 18V3 1882 1891 

TGC CAC TTG GAG CCA GTC CGT TGC CAG GCA CCC CCT CCC AGA TGC CCA TCG GAG 



1900 1909 1918 1927 \ 1936 1945 

GCA TCT GGT AGG ACT GTG GGT AGT GGG AAG GTG TAC CQC ACT GTG CAT ACC AGC 



V 



K V 



1954 1963 1972 1981 

CCT CCA CCA GAG ATA CTG AAG GAT AAA GCA CTA GTG GAG 



T V H T S 

1990 1999 
GCC CCC AGC CCT 



PE ILKDKALVE 

2008 " 2017 2026 2035 1044 2053 

GGG'-CCC CCC ACC CTC ACC AGC TTC AAC ATC CCA CCT GGG CCC "JTC AGC TCC ATG 



N 



H:\004\00210\l 



H 



DNAsrs DNA Translation [T-INS] 



2062 
CAG AAG CTC CTG 



2071 2080 2089 2098 2107 

ACA CAG AGT ACG GGA GCC TGC CAT AGC TCC TGC AAA ATC 




H K 



K 



2115 \2125 2134 2143 2152 2161 

TCC AGC CCT TGC TCC\AAG GCA GAC AGT GGA GCC TGC GGG CCG GAC AGT TGT CCC 



2170 217\ 2188 2197 2206 2215 

TAC TGT GCC CGG ACA GgA GCA GGA GAG CCA GAG TCC GCT GAC CAT GTC ATG CCT 



2224 2233 
GAC TCA GAC AGC GAG GCT 



M 



2242 2251 2260 2269 

TAT GAG TTC ACA CAG GAC GCT CAG CAC AGT GAC 



2278 2287 \ 2296 2305 2314 2323 

CTC CGG GAT CCC CAC AGC CGG CGG CGA CAG CGG AGC CTG GGC CCA GAT GCA GAG 



2332 2341 
CCT AGT TCT GTG CTG GCT TTC TGG 



R Q 
50 



2359 2368 2377 

CTG ATC TGT GAC ACA TTC CGG AAG ATC 



2386 2395 2404 \ 2413 2422 2431 

GTA GAT AGC AAA TAC TTT GGC CGG GGA ATC ATG ATC GCC ATC CTG GTC AAT ACA 



V 



N 



2440 2449 2458 \ 2467 2476 2485 

CTC AGC ATG GGC ATC GAG TAC CAC GAG CAG Of C GAG GAG CTC ACC AAC GCC CTG 

L S M G 

2494 2503 2512 2521 2530 2539 

GAA ATC AGC AAC ATC GTC TTC ACC AGC CTC TTC GCC TTG GAG ATG CTG CTG AAA 



N 



K 



2548 2557 2566 2575 \ 2584 2593 

CTG CTT GTC TAC GGT CCC TTT GGC TAC ATT AAG AAT flfCC TAC AAC ATC TTT GAT 

L L V Y 

2602 2611 2620 2629 \ 2638 2647 

GGT GTC ATT GTG GTC ATC AGT GTG TGG GAG ATT GTG GGc\cAG CAG GGA GGT GGC 

G V I V 

2656 2665 2674 2683 \2692 2701 

CTG TCG GTG CTG CGG ACC TTC CGC CTG ATG CGG GTG CTG AAG, CTG GTG CGC TTC 



2710 2719 2728 2737 2746 2755 

CTG CCG GCC CTG CAG CGC CAG CTC GTG GTG CTC ATG AAG ACC AT$ GAC AAC GTG 

N V 
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DNASIS DNA Translation [T-INS] 

2764 \ 2773 2782 2791 2800 2809 

GCC ACC TTC TGC ATG CTC CTC ATG CTG TTC ATC TTC ATC TTC AGC ATC CTG GGC 

- LMLFIFIFSI L. G 

2818 V2827 2836 2845 2854 2863 

ATG CAT CTC TTT GGT\ TGC AAG TTC GCA TCT GAA CGG GAT GGG GAC ACG TTG CCA 



M H L F C 

2872 2 
GAC CGG AAG AAT TTC 



,C K 
31 



2890 
TCC CTG CTC 



2899 
TGG GCC ATC 



2908 
GTC ACT GTC 



T L P 

2917 
TTT CAG ATT 



K 



N 



2926 2935\ 2944 

CTG ACT. CAG GAA GAC TGGIAAT AAA GTC 



2953 
CTC TAC AAC 



V T V 

2962 
GGC ATG GCC 



F Q I 

2971 
TCC ACA TCG 



L T Q E D W 



N 



M 



S T S 



2980 
TCT TGG GCT 



2989 
GCT CTT TAC 



2998 
ATC GCC 



3007 
CTC ATG ACT 



3016 
TTT GGC AAC 



3025 
TAT GTG CTC 



S W A A L Y 



3034 
TTT AAC CTG 



3043 
CTG GTG GCC 



3052 
: GTG 



ATT 
I L\ V 
106 

GGA CAG 1 
G Q 

31« 

AAG TCT GA<3 



3061 
GAA GGA TTC 



F G N 

3070 
CAG GCA GAG 



Y V L 

3079 
GAA ATC GGC 



F N L 

3088 
AAA CGG GAA 



L V A 

3097 
GAT GCG AGT 



Q A E E I G 



3115 
AGC TGT ATT 



3124 
CAG CTG CCT 



3133 
GTC AAC TCT 



K R E 

3142 
CAG GGG GGA 



DAS 

3151 
GAT GCC ACC 



' 3169 
TCA GAG CCT 



Q L P 

3178 
GAT TTC TTT 



V N S 

3187 
TCG CCC AGT 



Q G G DAT 



3196 
GTG GAT GGT 



3205 
GAT GGG GAC 



K S I 

3214 
AGA AAG AAG 



3223 
C<SC TTG GCC 



3232 
CTG GTG GCT 



S P S 

3241 
TTG GGA GAA 



V D G 

3250 
CAC GCG GAA 



D G D 

3259 
CTA CGA AAG 



R K K 

3268 
AGC CTT TTG 



3277 
CCA (5CC CTC 



3286 
ATC ATC CAT 



L G E 

3295 
ACG GCT GCG 



H A E 

3304 
ACA CCA ATG 



L R K 

3313 
TCA CTA CCC 



3322 
AAG AGC TCC 



T P M S L P 



3358 
TCT GGC TCT 



3367 
CGA CGT ACC 



3376 
AGT AGC AGT 



AGC ACA 
S T 

33E 

GGG TCC GO] 



3340 
GTG GGG GAA 



T A A 

3349 
GCA CTG GGC 



3394 
GAG CCT GGA 



A L G 

3403 
GCT GCC CAC 



S G S 

3412 
CAT* GAG ATG 



R R T 

3421 
AAA TCT CCG 



3430 
CCA AGT GCC 



3439 
CGC AGC TCC 



3448 
CG CAC AGT 



A A H 

3457 
CCC TGG AGT 



H E M K S P 



R S 



P W S 
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DNASIS DHA Translation [T-INS] 

346S V 3475 3484 3493 3502 3511 

GCG GCA AGC AGC T&G ACC AGC AGG CGC TCC AGC AGG AAC AGC CTG GGC CGG GCC 

A A S SW\TSRRSSRNSLGRA 

3520 *529 3538 3547 3556 3565 

CCC AGC CTA AAG CGg\aGG AGC CCG AGC GGG GAG CGG AGG TCC CTG CTG TCT GGA 

pslkrVrs PSGERRSLLSG 

3574 3533 3592 3601 3610 3619 

GAG GGC CAG GAG AGT CAG GAT GAG GAG GAA AGT TCA GAA GAG GAC CGG GCC AGC 

EG.QESQ\DEEESSEEDRAS 

3628 3637 \ 3646 3655 3664 3673 

CCA GCA GGC AGT GAC CAT GGC CAC AGG GGT TCC TTG GAA CGT GAG GCC AAG AGT 

"p A G~ SDH ftHRGSLEREAKS 

3682 3691 \ 3700 3709 3718 3727 

TCC TTT GAC CTG CCT GAC ACT\CTG CAG GTG CCG GGG CTG CAC CGC ACA GCC AGC 

S F D L P D T \L Q V P GL HRT A S 

3736 3745 \ 3754 3763 3772 3781 
GGC CGG AGC TCT GCC TCT GAG CAC CAA GAC TGT AAT GGC AAG TCG GCT TCA GGG 
\ 

GRS SASEh\qDCNGKSASG 

3790 3799 3808 3817 3826 3835 

CGT TTG GCC CGC ACC CTG AGG ACT GAT GAC CCC CAA CTG GAT GGG GAT GAT GAC 

RL ARTLRTbDPQLDGDDD 

3844 3853 3862 3871 3880 3889 

AAT GAT GAG GGA AAT CTG AGC AAA GGG, GAA CGC ATA CAA GCC TGG GTC AGA TCC 

NDE GNL S K G \E R I QAWVRS 

3898 3907 3916 \ 3925 3934 3943 

CGG CTT CCT GCC TGT TGC CGA GAG CGA GAT TCC TGG TCG GCC TAT ATC TTT CCT 

RLPACCRER D\SWSAY IF P 

3952 3961 3970 \ 3979 3988 3997 

CCT CAG TCA AGG TTT CGT CTC CTG TGT CAC ?GG ATC ATC ACC CAC AAG ATG TTT 

PQSRFRLLC'HAIITHKMF 

4006 4015 4024 Vo33 4042 4051 

GAC CAT GTG GTC CTC GTC ATC ATC TTC CTC AAC\TGT ATC ACC ATC GCT ATG GAG 

DHVVLVI IFLN^CITIAME 

4060 4069 4078 408V 4096 4105 

CGC CCC AAA ATT GAC CCC CAC AGC GCT GAG CGC ATC\ TTC CTG ACC CTC TCC AAC 

RPK I D P H S A E R I \F L T L S N 

4114 ' 4123 4132 .4141 \ 4150 4159 

TAC-ATC TTC ACG GCA GTC TTT CTA GCT GAA ATG ACA GTG AAG GTG GTG GCA CTG 

Y I F TAVF LA E M T V \ K V V A L 



H:\004\00210\l 



1 



DNASIS DNA Translation [T-INS] 



4158 
GGC TGG TGC TTT 



4177 4186 4195 4204 4213 

GAG CAG GCC TAC CTG CGC AGC AGC TGG AAT GTG CTG GAC 



W 



W N 



4222 4231 4240 4249 4258 4267 

GGC TTG CTG GTG CTC ATC TCC GTC ATC GAC ATC CTG GTC TCC ATG GTC TCC GAC 



V 



M V 



4276 4285\ 4294 4303 4312 4321 

AGC GGC ACC AAG ATC CTT\GGC ATG CTG AGG GTG CTG CGG CTG CTG CGG ACC CTG 



K 



M 



V 



4330 4339 \ 4348 4357 4366 4375 

CGT CCA CTC AGG GTC ATC AG? CGG GCC CAG GGA CTG AAG CTG GTG GTA GAG ACT 

R P L R V I 

4384 4393 \ 4402 4411 4420 4429 

CTG ATG TCA TCC CTC AAA CCC ATT GGC AAC ATT GTG GTC ATT TGC TGT GCC TTC 

L M ' S S L K 

4438 4447 \4456 4465 4474 4483 

TTC ATC ATT TTT GGA ATT CTC GGG\GTG CAG CTC TTC AAA GGG AAG TTC TTC GTG 

F F ' V 

4492 4501 4510 4519 4528 4537 

TGT CAG GGT GAG GAC ACC AGG AAC ATE ACT AAC AAA TCC GAC TGC GCT GAG GCC 



4546 4555 4564 \ 4573 4582 4591 

AGC TAC CGA TGG GTC CGG CAC AAG TAC AAC TTT GAC AAC CTG GGC CAG GCT CTG 



W 



V 



4600 4609 4618 

ATG TCC CTG TTT GTG CTG GCC TCC AAG GAT 



4627 4636 4645 

TGG GTT GAC ATC ATG TAT GAT 



M 



V 



4708 4717 4726 472 

ATG CTG CTA TAC TTC ATC TCC TTC CTC CTC ATC 



M 



4762 4771 4780 4789 

ATG TTT GTG GGC GTG GTG GTG GAG AAC TTC CAT AAG 




M 



4654 4663 4672 \4681 4690 4699 

GGG CTG GAT GCT GTG GGT GTG GAT CAG CAG CCCXATC ATG AAC CAC AAC CCC TGG 



N P W 

4744 4753 
GCC TTC TTT GTC CTG AAC 



4798 4807 
AGA CAG CAC CAG GAG 



4816 4825 4834 4843 

GAG .GAG GAG GCG AGG CGG CGT GAG GAG AAG CGA CTA CGG 



4852 4861 
CTG GAG AAA AAG 

E K K 
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DNASIS' DNA Translation. [T-INS] 

• 4870 \ 4879 4888 4897 4906 4915 

AGA AGG AAT CIA ATG TTG GAC GAT GTA ATT GCT TCC GGC AGC TCA GCC AGC GCT 



4924 \4933 4942 

GCG TCA GAA GCC CAGt TGC AAG CCC TAC 



4951 
TAC TCT GAC 



4978 49S7 4996 

CTT GTC CAC CAC CTG T$T ACC AGC CAC 



5005 
TAC CTG GAC 



L V H H 

5032 5041\ 5050 

ATC GGG CTG AAC GTG GTC ACT ATG GCC 



5059 
ATG GAA CAT 



I G L N V 

5086 5095 \ 5104 

CTG GAC GAG GCT CTG AAG ATG TGC AAT 



5113 
TAC ATC TTT 



5140 5149 \ 5158 

GAG TCA GTT TTC AAA CTT GTG GCC TTT 



5167 
GGC TTC CGC 



M 



V 




4960 
TAC TCG AGA TTC 



4969 
CGG CTC 



5014 
CTC TTC ATC ACT 



5023 
GGT GTC 



5068 

TAC CAG CAG CCC 



5077 
CAG ATC 



Q Q P 
5122 

ACC GTC ATC TTT 



5131 
GTC TTT 



5176 

CGT TTC TTC CAG 



5185 
GAC AGG 



5194 5203 
TGG AAC CAG CTG GAC CTG GCT ATT 



.5221 
CTT CTG TCC 



5230 
ATC ATG GGC ATC 



5239 
ACA CTG 



W N Q L 

5248 5257 5266X 

GAG GAG ATT GAG GTC AAT GCT TCG CTG 



5275 
!CC ATC AAC 



5284 
CCC ACC ATC ATC 



5293 
CGT ATC 



E V N 



5302 5311 5320 

ATG AGG GTG CTC CGC ATT GCT CGA GTT 



. 5329 
AAG CTG 



5338 
TTG AAG ATG GCT 



5347 
GTG GGC 



A R V 



5356 5365 5374 

ATG CGG GCA CTG CTG GAC ACG GTG ATG 



K M A 

5392 
CCC CAG GTG GGG 



V G 

5401 
AAC CTG 



M 



5410 5419 5428 

GGA CTT CTC TTC ATG TTA TTG TTT TTC 



Q V G 

5446 
GCT CTG GGC GTG 



N L 

5455 
GAG CTC 



5464 5473 5482 

TTT GGA GAC CTG GAG TGT. GAT GAG ACA 



G V 

5500 
G GGC TTG GGT 



5509 
CGG CAT 



G L G R H 



5518 5527 5536 

GCC -ACC TTT AGG AAC TTT GGT ATG GCC 



5545 
TTT CTG ACC 



5554 
i TTC CGA GTC 



R V 



5563 
TCC ACT 
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DNASIS DMA Translation [T-INS] 



5572 
GGT GAC AAC TGG 



5581 5590 5599 5608 5617 

GGT ATT ATG AAG GAC ACC CTC CGG GAC TGT GAC CAG GAG 



N W 



5626 

TCC ACC TGC TAC AAC 



5644 5653 5662 5671 

GTC ATC TCC CCT ATC TAC TTT GTG TCC TTC GTG CTG 



S F V L 

5680 5689\ 5698 5707 5716 5725 

ACG GCC CAG TTT GTG CTG \jTC AAC GTG GTC ATA GCT GTG CTG ATG AAG CAC CTG 

M K H L 

5734 5743 \ 5752 5761 5770 5779 

GAA GAA AGC AAC AAA GAG Cck AAG GAG GAG GCC GAG CTC GAG GCC GAG CTG GAG 

E E S N 

5788 5797 \ 5806 5815 5824 5833 

CTG GAG ATG AAG ACG CTC AGC CCG CAG CCC CAC TCC CCG CTG GGC AGC CCC TTC 

L E M K 

5842 5851 5860 5869 5878 5887 

CTC TGG CCC GGG GTG GAG GGT GTC \aC AGT CCT GAC AGC CCT AAG CCT GGG GCT 

L W P G 



V 



M 



N 




5896 5905 591 

CCA CAC ACC ACT GCC CAC ATT GGA 



5923 5932 5941 

TCG GGC TTC TCC CTT GAG CAC CCC 



5950 5959 5968 \ 5977 5986 5995 

ACG ATG GTA CCC CAC CCC GAG GAG GTG CCA. GTC CCC CTA GGA CCA GAC CTG CTG 



M 



6004 6013 6022 

ACT GTG AGG AAG TCT GGT GTC AGC CGG ACG 



6031 6040 6049 

• TCT CTG CCC AAT GAC AGC TAC 



N 



6058 6067 6076 6085 6094 6103 

ATG TGC CGC AAT GGG AGC ACT GCT GAG AGA TCC CTA GGA CAC AGG GGC TGG GGG 



W 



6112 6121 6130 6139. 6148 6157 

CTC CCC AAA GCC CAG TCA GGC TCC ATC TTG TCC GTTXCAC TCC CAA CCA GCA GAC 



6166 6175 6184 6193 \ 6202 6211 

ACC AGC TGC ATC CTA CAG CTT CCC AAA GAT GTG CAC TAT\ CTG CTC CAG CCT CAT 

L Q P H 

6220 6229 6238 ■ 6247 V 6256 6265 

GGG GCC CCC ACC TGG GGC .GCC ATC CCT AAA CTA CCC CCA COT GGC CGC TCC CCT 
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DNASIS 



DNA Translation [T-INS] 



6274 
CTG GCT CAG 



6283 6292 6301 6310 6319 

CCT CTC AGG CGC CAG GCA GCA ATA AGG ACT GAC TCC CTG GAT 



6328 \ 6337 6346 6355 6364 6373 

GTG CAG GGC CTG GST AGC CGG GAA GAC CTG TTG TCA GAG GTG ACT GGG CCC TCC 



V 




V 



6382 6391 6400 6409 6418 6427 

TGC CCT CTG ACC CGG TCC TCA TCC TTC TGG GGC GGG TCG AGC ATC CAG GTG CAG 



W 



V 



6436 6445\ 6454 6453 6472 6481 

CAG CGT TCC GGC ATC CAGVAGC AAA GTC TCC AAG CAC ATC CGC CTG CCA GCC CCT 



6490 6499 \ 6508 6517 6526 6535 

TGC CCA GGC CTG GAA CCC AGO TGG GCC AAG GAC CCT CCA GAG ACC AGA AGC AGC 



6544 5553 
TTA GAG CTG GAC ACG GAG CTG 



6562 6571 6580 6589 

TGG ATT TCA GGA GAC CTC CTT CCC AGC AGC 



6598 6607 6616 6625 6634 6643 

CAG GAA GAA CCC CTG TCC CCA CGG SAC CTG AAG AAG TGC TAC ACT GTA GAG ACC 



6652 6661 6670 1 

CAG AGC TGC AGG CGC AGG CCT GGG TCC 



6868 6877 6886 6895' 

CCA TGC AGT CCT GGT GTC TGC CTC AGG AGG AGG GCG 



V 




6679 6688 6697 

CTA GAT GAA CAG CGG AGA CAC TCC 



6706 6715 6724 \ 6733 6742 6751 

ATT GCT GTC AGC TGT CTG GAC AGC GGC TCQ CAA CCC CGC CTA TGT CCA AGC CCC 



6760 6769 6778 \ 6787 6796 6805 

TCA AGC CTC GGG GGC CAA CCT CTT GGG GGT COp GGG AGC CGG CCT AAG AAA AAA 

P K K K 

6814 6823 6832 6841 6850 6859 

CTC AGC CCA CCC AGT ATC TCT ATA GAC CCC CCG AGC CAG GGC TCT CGG CCC 



6904 6913 
GCC AGT GAC TCT AAG 



6922 6931 6940 6949 \ 6958 6967 

GAT CCC TCG GTC TCC AGC CCC CTT GAC AGC ACG GCT GCC\TCA CCC TCC CCA AAG 



K 
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DNASIS DNA Translation [T-INS] 

6976 V 6985 6994 7003 7012 1021 

AAA GAC ACG CTG A^T CTC TCT GGT TTG TCT TCT GAC CCA ACA GAC ATG GAC CCC 

GLSSDPTDMDP 

9 7048 7057 7066 7075 

CCC CAT CAC CTT TCT CCA CCG GGT GCA GAT CCT AGC 



7030 7 
TGA GTC CTA CCC ACT 



V 



7084 7093 
TCC GCC TCC TGG GCA GCG 




HHLSPPGADPS 

7102 7111 7120 7129 

CTG AAA AGT CCC ACG TAA GCA GCA AGC AGC CAC 



W 




\L K 
7 



7138 7147 \ 7156 7165 7174 7183 

GAG GCA CCT CAC CTG CCT TCT TEA GTG GCT GGT GGG GAT GAC GAG CAG AAC TTC 



H 




N 



7192 7201 7210 7219 7228 7237 

CGG AGA GTC GAT CTG AAG AGA ACA C&G CCC TGG AGC CCC TGC CTC CGG GAA GAA 



V 



K 



7246 7255 7264 

GGA AAA GGA GAA AGC CCA GTG TGG CCA 

GKGESPVW 



WSPCLREE 

7273 7282 / < 

CTC CCG ACA CCA GGA GCT G /3 1 



1 



p g" "a/j^w 2 * H 
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DNASIS 
T-INS 
568 



,00- 



f> 10 
ftGCTGAGC TGAAC! 

70 

CTCCCCCACC GGGGT 

130 \J.40 
CCTCTTCGGA CCCCCCC 

190 2t 
GGGCGCCGAG GAGTCGGGaIc 

250 26d 
CGGGGGCCGG CAGGGGCCGG 

310 320 
GGAGGGGCTG CCGTACCCGG 

370 380 
CCGCCCGCGG AGCTGGTGTC 

430 440 
GCTGGTCATT CTTCTCAACT 

490 500 
CTGTGACTCC CAGCGCTGCC 

550 560 
TGCTGTGGAA ATGGTGGTGA 



30 

CTCCTGGGGA 
90 

GGTTGCGTGA 
150 

CCCCGGCTGG 
210 

AGCCCCGTAG 



40 

CTCAGCAAGC 
100 

GGACACCTCC 



50 

TCTCTAGAGC 
110 

TCTGAGGGGC 



220 

CTTCACGCAG 
280 

AAAGGACCCG 
340 

GGTGGTTTTC 
400 

CTGTAACCCG 
460 

GGGTATGTTC 
520 

GGCCTTCGAT 



170 

GACGAGGAGG 
230 

CTCAACGACC 
290 

GGCAGCGCGG 
350 

TTCTACTTGA 
410. 

TGGTTCGAGC 
470 

AGGCCGTGTG 
530 

GACTTCATCT 
590 

TTTGGGAAGA ' 



610 620 
GGGAGACACT TGGAACCGGC 

670 680 
GCTGGACCTG CAGAACGTCA 

730 740 
CAGGGCCATT AACCGGGTGC 

790 800 
GCCTATGCTG GGCAACGTCC 

850 860 
GGGCGTCCAG CTGTGGGCAG 

910 920 
CCTCCCCCTG AGCGTGGACC 

970 980 
CTTCATCTGC TCTCAGCCTC 

1030 1040 
GCGTGGGGAA GGCGGTGGTG 

1090 1100 
CAGCAACACC ACCTGTGTCA 

1150 1160 
CAACCCCTTC AAAGGCGCCA 

1210 1220 
CCAGGTCATC ACACTGGAGG 




60 

CCCCCACATG 
120 

TCCGCTCGCC 

■ 5^a/-+ *tt 

180 jeo IP *>o: I 

AGGATGGAGC ao J;aj rt.fi** 



750 

CCAGCATGCG 
810 

TGCTGCTCTG 
870 

GACTGCTTCG 
930 

TGGAGCCTTA 
990 

GGGAGAATGG 

1050 
GCCCACCCTG 

1110 
ACTGGAACCA 

1170 
TCAACTTTGA 

1230 
GCTGGGTCGA 



650 

GCAGGGATGC 
710 

GTCCGTGTGC 
770 

ACATTACTGC 
830 

TTTTTCATCT 



890 

TTCCTCCCCG 



10( 
CATGAGATCtf 

1060 
CAGTCTGGAC 

1120 
GTACTATACC 

1180 
CAACATTGGC 

1240 
CATCATGTAC 



240 

TGTCCGGGGC 
300 

ACTCCGAGGC 
360 

GCCAGGACAG 
420 

GAGTCAGTAT 
480 

AGGACATTGC 
540 

TTGCCTTCTT 
600 

"AATGTTACCT 
660 

TGGAGTATTC 
720 

TGCGACCGCT 
780 

TGGACACCTT 
840 

TTGGCATCGT 
900 

AGAACTTCAG 
960 

ACGAGAGCCC 

1020 
TGCCCACACT 

1080 
ATAACAGTTC 

1140 
CGGGCGAGCA 

1200 
TCGCCATCTT 

1260 
ACGCTCACTC 
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DNASIS 






T-INS \ 






1270 


\ 1280 


1290 


CTTCTACAAC 


TTCATCTACT 


TCATTdTCT 


1330 


\ 1340 


1350 


CCTGTGCCTG 


GTGGTGATTG 


CCACGCAGTT 


1390 


\ 1400 


1410 


GATGCGGGAG 


CAGCOTGTAC 


GATTCCTGTC 


1450 


U460 


1470 


GCCAGGCAGC 


TGCTATOAGG 


AGCTACTCAA 


1510 


1520 


1530 


CCGAAGGCTG 


GCCCAGGTOT 


CTAGGGCTAT 


1570 


1580\ 


l 1590 


AGTGGCCCGT 


AGTGGGCAGG 


Vgccccagcc 


1630 


1640 


\ 1650 


TCTGTCTGTC 


CACCACCTGG 


TCfcACCACCA 


1690 


1700 


\ 1710 


TAATGGGACG 


CTCAGAGTTC 


CCCGGGCCAG 


1750 


1760 


\l770 


GTCTCGCCGG 


CTCATGCTAC 


CACCAOCCTC 


1810 


1820 


1830 


GGGTGCGGAG 


TCTGTACACA 


GCTTCTACCA 


1870 


1880 


1890 


CCAGGCACCC 


CCTCCCAGAT 


GCCCATCGGa 


1930 


1940 


1950 


GGTGTACCCC 


ACTGTGCATA 


CCAGCCCTCC 


1990 


2000 


2010 


GGAGGTGGCC 


CCCAGCCCTG 


GGCCCCCCAC 


2050 


2060 


2070 


CTTCAGCTCC 


ATGCACAAGC 


TCCTGGAGAC 


2110 


2120 


2130 


CAAAATCTCC 


AGCCCTTGCT 


CCAAGGCAGA 


2170 


2180 


2190 


CTACTGTGCC 


CGGACAGGAG 


CAGGAGAGCC 


2230 


2240 


2250 


AGACAGCGAG 


GCTGTGTATG 


AGTTCACACA 


2290 


2300 


2310 


CCACAGCCGG 


CGGCGACAGC 


GGAGCCTGGG 


2350 


2360 


2370 


TTTCTGGAGG 


CTGATCTGTG 


ACACATTCCG 


2410 


2420 


2430 


GGGAATCATG 


ATCGCCATCC 


TGGTCAATAC 


2470 


2480 


2490 


GCCCGAGGAG 


CTCACCAACG 


CCCTGGAAAT 



1300 


1310 


1320 


CATCATCGTG 


GGCTCCTTCT 


TCATGATCAA 


1360 


1370 


1380 


CTCCGAGACC 


AAACAGCGGG 


AGAGTCAGCT 


.3-420 


1430 


1440 


ti fry** »in\pn 

CAATGCTAGC 


ALU. rutiCAA 


GCTIXrrCTGA 


1480 


1490 


1500 


GTACCTuCj l\j 


i aca rut, rcc 


GAAAAGCAGC 


1540 


1550 


1560 






TC.AGCAGCC.v- 


1600 


1610 


1620 








1660 


1670 


1680 


TCACQ-AC.I-AL. 


/■* 7\ TV* ta ta 
tAJ\-Av„C-AC_ 1 


AC(-ACt_TGG<j 


1720 


1730 


1740 




^ ta cv ta r* ta sw* 
LA.UviA(_A(A3ur 


ATGCCAATGo 


1780 


1790 


1800 


TACACCCAt. I 




GCCCTCCGAG 


1840 


1850 


1860 




<JAL. i IXjCiACaU 


CAGTCCGTTG 


1900 


1910 


1920 


t3\3\_AH- i. 1 






\ 1960 


1970 


1980 


^CCAGAGAIA 


i- IvjAAtjuAIA 


AAGCACTAGT 


\ 2020 


2030 


2040 


CC'lyACCAGC 


1 ACAACATCC 


CACCTGGGCC 


\ 2080 


2090 


2100 


ACAGAETACG 


GGAGCCTGCC 


ATAGCTCCTG 


2140 


2150 


2160 


C^GTGGA*CC 


TGCGGGCCGG 


ACaGTTGTCC 


22{\0 


2210 


2220 


AGAGTCCGCT 


GACCATGTCA 


TGCCTGACTC 


2260^ 


^ 2270 


2280 


GGACGCTCAG 


CACAGTGACC 


TCCGGGATCC 


2320 


\ 2330 


2340 


CCCAGATGCA 


GAGCCTAGTT 


CTGTGCTGGC 


2380 


\ 2390 


2400 


GAAGATCGTA 


GATAECAAAT 


ACTTTGGCCG 


2440 


\450 


2460 



ACTCAGCATG GGCATCQAGT ACCACGAGCA 

2500 2s\o 2520 

CAGCAACATC GTCTTCACCA GCCTCTTCGC 
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2530 


2540 


2550 


CTTGGAGATa 


CTGCTGAAAC 


TGCTTGTCTA 


2590 


\ 2600 


2610 


CAACATCTCT 


GATGGTGTCA 


TTGTGGTCAT 


2550 


\ 2660 


2670 


AGGTGGCCTG 


TCQGTGCTGC 


GGACCTTCCG 


2710 


\ 2720 


2730 


CCTGCCGGCC 


CTGCHGCGCC 


AGCTCGTGGT 


2770 


2780 


2790 


CTTCTGCATG 


CTCCTOSTGC 


TGTTCATCTT 


2830 


28>40 


2850 


TGGTTGCAAG 


TTCGCATCTG 


AACGGGATGG 


2890 


2900^ 


^ 2910 


CTCCCTGCTC 


TGGGCCATCG 


VcACTGTCTT 


2950 


2960 


\ 2970 


AGTCCTCTAC 


AACGGCATGG 


CCTCCACATC 


3010 


3020 


\ 3030 


CATGACTTTT 


GGCAACTATG 


TGCTOTTTAA 


3070 


3080 


3090 


CCAGGCAGAG 


GAAATCGGCA 


AACGGG^AGA 


3130 


3140 


3 ISO 


GCCTGTCAAC 


TCTCAGGGGG 


GAGATGCCAC 


3190 


3200 


3210* 


GCCCAGTGTG 


GATGGTGATG 


GGGACAGAAA 


3250 


3260 


3270 


ACACGCGGAA 


CTACGAAAGA 


GCCTTTTGCC 


3310 


3320 


3330 


AATGTCACTA 


CCCAAGAGCT 


CCAGCACAGG 


3370 


3380 


3390 


ACGTAGCAGT 


AGCAGTGGGT 


CCGCTGAGCC 


3430 


3440 


3450 


GCCAAGTGCC 


CGCAGCTCCC 


CGCACAGTCC 


3490 


3500 


3510 


G\_GCTCCAGC 


AGGAACAGCC 


TGGGCCGGGC 


3550 


3560 


3570 


GGAGCGGAGG 


TCCCTGCTGT 


CTGGAGAGGG 


3S10 


3620 


3630 


AGAAGAGGAC 


CGGGCCAGCC 


CAGCAGGCAG 


3670 


3680 


3690 


TGAGGCCAAG 


AGTTCCTTTG 


ACCTGCCTGA 


3730 


3740 


3750 


AGCCAGCGGC 


CGGAGCTCTG 


CCTCTGAGCA 



2560 


2570 


2580 


CGGTCCCTJ.T 


GGCTACATTA 


AGAATCCCTA 


2620 


2630 


2640 


CAGTGTGTGG 


GAGATTGTGG 


GCCAGCAGGG 


2680 


2690 


2700 


CCIGATGCGG 


GTGCTGAAGC 


TGGTGCGCTT 


2740 


2750 


2760 


GCTCATGAAG 


ACCATGGACA 


ACGTGGCCAC 


2800 


2810 


2820 


CATCTTCAGC 


ATCCTGGGCA 


TGCATCTCTT 


2860 


2870 


2880 


GGACACGTTG 


CCAGACCGGA 


AGAATTTCGA 


2920 


2930 


2940 


TCAGATTCTG 


ACTCAGGAAG 


ACTGGAATAA 


2980 


2990 


3000 


GTCTTGGGCT 


GCTCTTTACT 


TCATCGCCCT 


3040 


3050 


3060 


CCTGCTGGTG 


gccaitcitc 


TGGAAGGATT 


3100 


3110 


3120 


TGCGAGTGGA 


CAGTTAAGCT 


GTATTCAGCT 


3160 


3170 


3180 


CAAGTCTGAG 


TCAGAGCCTG 


ATTTCTTTTC 


i 3220 


3230 


3240 


GAAGCGCTTG 


GCCCTGGTGG 


CTTTGGGAGA 


\ 3280 


3290 


3300 


ACECCTCATC 


ATCCATACGG 


CTGCGACACC 


\ 3340 


3350 


3360 




GCACTGGGCT 


CTGGCTCTCG 


N3400 


3410 


3420 


TGGAGCTGCC 


CACCATGAGA 


TGAAATCTCC 


341^0 


3470 


3480 


CTGGAGTGOG 


GCAAGCAGCT 


GGACCAGCAG 


3520\ 


t 3530 


3540 


CCCCAGCCTA 


AAGCGGAGGA 


GCCCGAGCGG 


3580 


\ 3590 


3600 


CCAGGAGAGT 


CAEGATGAGG 


AGGAAAGTTC 


3640 


\ 3650 


3660 


TGACCATCGC 


CACAGGGGTT 


CCTTGGAACG 


3700 


Y710 


3720 


CACTCTGCAG 


GTGCCGGGGC 


TGCACCGCAC 


3760 


3770 


3780 


CCAAGACTGT 


AATGGCAAGff 


CGGCTTCAGG 



00210U 



4 



4 



DNASIS 
T-XNS 




3790 


\3800 


3810 


3820 


3830 


3840 


GCGTTTGGCC 


CGCACCETGA 


GGACTGATGA 


CCCCCAACTG 


GATGGGGATG 


ATGACAATGA 


3850 


3860 


3870 


3880 


3890 


3900 


TGAGGGAAAT 


CTGAGCAAAG 


GGGAACGCAT 


ACAAGCCTGG 


GTCAGATCCC 


GGCTTCCTGC 


3910 


3920, 


3930 


3940 


3950 


3960 


CTGTTGCCGA 


GAGCGAGATT 


yCCTGGTCGGC 


CTATATCTTT 


CCTCCTCAGT 


CAAGGTTTCG 


3970 


3980 


\ 3990 


4000 


4010 


4020 


TCTCCTGTGT 


CACCGGATCA 


TCACCCACAA 


GATGTTTGAC 


CATGTGGTCC 


TCGTCATCAT 


4030 


4040 


\ 4050 


4060 


4070 


4080 


CTTCCTCAAC 


TGTATCACCA TCGCTATGGA 


GCGCCCCAAA 


ATTGACCCCC 


ACAGCGCTGA 


4090 


4100 


\ 4110 


4120 


4130 


4140 


GCGCATCTTC 


CTGACCCTCT 


CCAACTACAT 


CTTCACGGCA 


GTCTTTCTAG 


CTGAAATGAC 


4150 


4160 


4170 


4180 


4190 


4200 


AGTGAAGGTG 


GTGGCACTGG 


GCTGGTGiCTT 


TGGGGAGCAG 


GCCTACCTGC 


GCAGCAGCTG 


4210 


4220 


4230 


4240 


4250 


4260 


GAATGTGCTG 


GACGGCTTGC 


tggtgctcaY 


CTCCGTCATC 


GACATCCTGG 


TCTCCATGGT 


4270 


4280 


4290' 


y 4300 


4310 


4320 


CTCCGACAGC 


GGCACCAAGA TCCTTGGCAT 


GCTGAGGGTG 


CTGCGGCTGC 


TGCGGACCCT 


4330 


4340 


4350 


\ 4360 


4370 


4380 


GCGTCCACTC 


AGGGTCATCA 


GCCGGGCCCA 


GGGACTGAAG 


CTGGTGGTAG 


AGACTCTGAT 


4390 


4400 


4410 


\ 4420 


4430 


4440 


GTCATCCCTC 


AAACCCATTG 


GCAACATTGT 


GGTGATTTGC 


TGTGCCTTCT 


TCATCATTTT 


4450 


4460 


4470 


\ 4480 


4490 


4500 


TGGAATTCTC 


GGGGTGCAGC 


TCTTCAAAGG 




GTGTGTCAGG 


GTGAGGACAC 


4510 


4520 


4530 


4540 


4550 


4560 


CAGGAACATC 


ACTAACAAAT 


CCGACTGCGC 


TCAGGCCAGC 


TACCGATGGG 


TCCGGCACAA 


4570 


4580 


4590 


4600 


4510 


4620 


GTACAACTTT 


GACAACCTGG 


GCCAGGCTCT 


GATGTCCCTG 


TTTGTGCTGG 


CCTCCAAGGA 


4630 


4640 


4650 


46601 


4670 


4680 


TGGTTGGGTT 


GACATCATGT 


ATGATGGGCT 


GGATGCTGTG 


\GGTGTGGATC 


AGCAGCCCAT 


4690 


4700 


4710 


4720 


\ 4730 


4740 


CATGAACCAC 


AACCCCTGGA 


TGCTGCTATA 


CTTCATCTCC 


TJCCTCCTCA 


TCGTGGCCTT 


4750 


4760 


4770 


4780 


\ 4790 


4800 


CTTTGTCCTG 


AACATGTTTG 


TGGGCGTGGT 


GGTGGAGAAC 


TTCCATAAGT 


GCAGACAGCA 


4810 


4820 


4830 


4840 


\4850 


4860 


CCAGGAGGAG 


GAGGAGGCGA 


GGCGGCGTGA 


GGAGAAGCGA 


CTACGG&GGC 


TGGAGAAAAA 


4870 


4880 


4890 


4900 


490.0 


4920 


GAGAAGGAAT 


CTAATGTTGG 


ACGATGTAAT 


TGCTTCCGGC 


AGCTCAGCOA 


GCGCTGCGTC 


4930 


4940 


4950 


4960 


4970\ 


4980 


AGAAGCCCAG 


TGCAAGCCCT 


ACTACTCTGA 


CTACTCGAGA 


TTCCGGCTCC 


VtTGTCCACCA 


4990 


5000 


5010 


5020 


5030 


\ 5040 


CCTGTGTACC 


AGCCACTACC 


TGGACCTCTT 


CATCACTGGT 


GTCATCGGGC 


TvEAACGTGGT 
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5050 


V 5060 


5070 


CACTATGGCC 


ATGGAACATT 


ACCAGCAGCC 


5110 


\5120 


5130 


CAATTACATC 


TTTACCGTCA 


TCTTTGTCTT 


5170 


50.80 


5190 


CTTCCGCCGT 


TTCTTCCAGG 


ACAGGTGGAA 


5230 


5240 


5250 


CATCATGGGC 


ATCACACTGG 


AGGAGATTGA 


5290 


5300 


\ 5310 


CATCATCCGT 


ATCATGAGGG 


TGCTCCGCAT 


5350 


5360 


\ 5370 


TGTGGGCATG 


CGGGCACTGC 


TQGACACGGT 


5410 


5420 


\ 5430 


GGGACTTCTC 


TTCATGTTAT 


TG'iTTITCAT 


5470 


5480 


\ 5490 


AGACCTGGAG 


TGTGATGAGA 


CACAOCCTTG 


5530 


5540 


>£550 


GAACTTTGGT 


ATGGCCTTTC 


TGACCC1CTT 


5590 


5600 


5610 


TATTATGAAG 


GACACCCTCC 


gggactgtga 


5650 


5660 


5670 


CTCCCCTATC 


TACTTTGTGT 


CCTTCGTGCT 


5710 


5720 


5730 


CATAGCTGTG 


CTGATGAAGC 


ACCTGGAAGA 


5770 


5780 


£790 


GCTCGAGGCC 


GAGCTGGAGC 


TGGAGATGAA 


5830 


5840 


5850 


GGGCAGCCCC 


TTCCTCTGGC 


CCGGGGTGGA 


5890 


5900 


. 5910 


TGGGGCTCCA 


CACACCACTG 


CCCACATTGG 


5950 


5960 


5970 


CACGATGGTA 


CCCCACCCCG 


AGGAGGTGCC 


6010 


6020 


6030 


GAGGAAGTCT 


GGTGTCAGCC 


GGACGCACTC 


6070 


6080 


6090 


TGGGAGCACT 


GCTGAGAGAT 


CCCTAGGACA 


6130 


6140 


6150 


AGGCTCCATC 


TTGTCCGTTC 


ACTCCCAACC 


6190 


6200 


6210 


CAAAGATGTG 


CACTATCTGC 


TCCAGCCTCA 


3 6250 


6260 


6270 


ACTACCCCCA 


CCTGGCCGCT 


CCCCTCTGGC 




) 



5080 


5090 


5100 


CCAGATCCTG 


GACGAGGCTC 


TGAAGATCTG 


5140 


5150 


5160 


TGAGTCAGTT 


TTCAAACTTG 


TGGCCTTTGG 


5200 


5210 


5220 


CCAGCTGGAC 


CTGGCTATTG 


TGCTTCTGTC 


5260 


5270 


5280 


GGTCAATGCT 


TCGCTGCCCA 


TCAACCCCAC 


5320 


5330 


5340 


TGCTCGAGTT 


CTGAAGCTGT 


TGAAGATGGC 


5380 


5390 


5400 


GATGCAGGCC 


CTGCCCCAGG 


TGGGGAACCT 


5440 


5450 


5460 


CTTTGCAGCT 


CTGGGCGTGG 


AGCTCTTTGG 


5500 


5510 


5520 


TGAGGGCTTG 


GGTCGGCATG 


CCACCTTIAG 


5560 


5570 


5580 


CCGAGTCTCC 


ACTGGTGACA 


ACTGGAATGG 


5620 


5630 


5640 


CCAGGAGTCC 


ACCTGCTACA 


ACACTGTCAT 


5680 


5690 


5700 


GACGGCCCAG 


TTTGTGCTGG 


TCAACGTGGT 


y 5740 


5750 


5760 


Vagcaacaaa 


GAGGCCAAGG 


AGGAGGCCGA 


\ 5800 


5810 


5820 


GACGCTCAGC 


CCGCAGCCCC 


ACTCCCCGCT 


\ 5860 


5870 


5880 


GGCiprCAAC 


AGTCCTGACA 


GCCCTAAGCC 


\ 5920 


5930 


5940 


AGCAGCCTCG 


GGCTTCTCCC 


TTGAGCACCC 


5980 


5990 


6000 


AGTCCCqCEA 


GGACCAGACC 


TGCTGACTGT 


60V0 


6050 


6060 


TCTGCCCAaa? 


GACAGCTACA 


TGTGCCGCAA 


6100\ 


. 6110 


6120 


CAGGGGCTGG 


CGGCTCCCCA 


AAGCCCAGTC 


6160 


\ 6170 


6180 


AGCAGACACC 


AGCTGCATCC 


TACAGCTTCC 


6220 


\ 6230 


6240 


TGGGGCCCCC 


ACCTSGGGGCG 


CCATCCCTAA 


6280 


\ 6290 


6300 


TCAGAGGCCT 


CTCAGECGCC 


AGGCAGCAAT 
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6310 


\ 6320 


6330 


6340 


6350 


6360 


AAGGACTCAC 


tccCtggatg 


TGCAGGGCCT 


GGGTAGCCGG 


GAAGACCTGT 


TGTCAGAGGT 


6370 


\ 6380 


6390 


6400 


6410 


6420 


GAGTGGGCCC 


TCCTGCCCTC 


TGACCCGGTC 


CTCATCCTTC 


TGGGGCGGGT 


CGAGCATCCA 


6430 


V44O 


6450 


6460 


6470 


6480 


GGTGCAGCAG 


CGTTCCOSCA 


TCCAGAGCAA 


AGTCTCCAAG 


CACATCCGCC 


TGCCAGCCCC 


6490 


65.00 


6510 


6520 


6530 


6540 


TTGCCCAGGC 


CTGGAACCCA 


GCTGGGCCAA 


GGACCCTCCA 


GAGACCAGAA 


GCAGCTTAGA 


6550 


6560 


6570 


6580 


6590 


6600 


GCTGGACACG 




GGATTTCAGG 


AGACCTCCTT 


CCCAGCAGCC 


AGGAAGAACC 


6610 


6620 


\ 6630 


6640 


6650 


6660 


CCTGTCCCCA 


CGGGACCTGA 


AGAAGTGCTA 


CAGTGTAGAG 


ACCCAGAGCT 


GCAGGCGCAG 


6670 


6680 


\ 6690 


6700 


6710 


6720 


GCCTGGGTCC 


TGGCTAGATG 


AACAGCGGAG 


ACACTCCATT 


GCTGTCAGCT 


GTCTGGACAG 


6730 


6740 


\ 6750 


6760 


6770 


6780 


CGGCTCCCAA 


CCCCGCCTAT 


GTuCAAGCCC 


CTCAAGCCTC 


GGGGGCCAAC 


CTCTTGGGGG 


6790 


6800 


\ 6810 


6820 


6830 


6840 


TCCTGGGAGC 


CGGCCTAAGA 


AAAAACTCAG 


CCCACCCAGT 


ATCTCTATAG 


ACCCCCCGGA 


6850 


6860 


\ 6870 


6880 


6890 


6900 


GAGCCAGGGC 


TCTCGGCCCC 


CATGCAGTCC 


TGGTGTCTGC 


CTCAGGAGGA 


GGGCGCCGGC 


6910 


6920 


S930 


6940 


6950 


6960 


CAGTGACTCT 


AAGGATCCCT 


CGGTCTCCAG 


CCCCCTTGAC 


AGCACGGCTG 


CCTCACCCTC 


6970 


6980 


6990 


7000 


7010 


7020 


CCCAAAGAAA 


GACACGCTGA' GTCTCTCTGG 


'IITCTCTTCT 


GACCCAACAG 


ACATGGACCC 


7030 


7040 


7050\ 


7060 


7070 


7080 


CTGAGTCCTA 


CCCACTCTCC 


CCCATCACCT 








7090 


7100 


7110 


\ 7120 


7130 


7140 


CTCCTGGGCA 


GCGTTTCTGA AAAGTCCCAC 


GTAAGCAGCA 


AGCAGCCACG 


AGGCACCTCA 


7150 


7160 


7170 


\ 7180 


7190 


7200 


CCTGCCTTCT 


TCAGTGGCTG 


GTGGGGATGA 


CGAGCAGAAC 


TTCCGGAGAG 


TCGATCTGAA 


7210 


7220 


7230 


\7240 


7250 


7260 


GAGAACACAG 


CCCTGGAGCC 


CCTGCCTCCG 


GGAAGHAGGA 


AAAGGAGAAA 


GCCCAGTGTG 


7270 


7280 


7290 


7KI0 


7310 


7320 


GCCAAGGCTC 


CCGACACCAG 
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els*tgippglisasslepptcsptgvprlredtsseglrsplfgppg 
a p a g q r^b deledgagaeesgqprsftqlndlsgaggrqgpgstek 
dpgsadseaeclpypalapvvffylsqdsrprswclrtvcnp wfe 
rvsmlvilln g. v tlgmfrpcediacdsqrcrilqafddfifaffav 
emvvkmval gif gkkcylgdtwnrldffiviagmleysldlqnvs 
fsavrtvrvlrplrainrvpsmrilvtllldtlpmlgnvlllcffv 
ffifgivgvqlwYgllrnrcflpenfslpls vdlep y yqt en e de s 
pfic s qprengmrs crs vptlrg eggggppcs ld y etynsssntt 
cvnwnqyytncs\agehnpfkgainfdnigyawiaifqvitlegwv 
dimyfvmdahsfmfiyfilliivgsffminlclvviatqfsetkqr 
esqlmreqrvrflbnastlasfsepgscyeellkylvyilrkaar 
rlaqvsraigvraqllsspvarsgqepqpsgsctrskrrlsvhhl 
vhhhhhhhhhyh l(s ngtlrvpraspeiqdrdangsrrlmlpppst 
ptpsggpprgaesv hs fyhadchlepvrcqappprcps eas grtv 
gsgkvyptvhtspppbilkdkalvevapspgpptltsfnippgpfss 
mhklletqstgachs\sckisspcskadsgacgpdscpycartgag 
epesadhvmpdsdseavyeftqdaqhsdlrdphsrrrqrslgpda 
epssvlafwrlicdtfmivdskyfgrgimiailvntlsmgieyheq 
peeltnaleisnivfts df alemllkllvygpfgyiknpynifd gvi 
vvisvweivgqqggglsVlrtfrlmrvlklvrflpalqrqlvvlm 
ktmdnvatfcmllmlflvlfsilgmhlfgckfaserdgdtlpdrk 
nfdsllwaivtvfqiltq bd wnkvlyngmastsswaalyfialmt 
fgn yvlfnllv ailvegfqkaeeigkred asgqlsciqlp vn sqggd 
atks esepdffspsvdgd gd rkkrlalvalgehaelrksllp p lii 
htaatpmslpkssstgvgexlgsgsrrtsssgsaepgaahhemks 
ppsarssphspwsaasswtarrssrnslgrapslkrrspsgerrs 
llsgegqesqdeeesseedraspagsdhrhrgslereakssfdlpd 
tlqvpglhrtasgrssaseh qyd cngksasgrlartlrtddpqldg 
DDDNDEGNLSKGERIQAWVR sr lpaccrerdswsayifpp qs r fr 
llchriithkmfdhvvlviifl sc itiamerpkidphsaerifltl s n 
yiftavflaemtvkvvalgwcrgeqaylrsswnvldgllvlisvi 
dilvsmvsdsgtkilgmlrvlrAlrtlrplrvisraqglklvvetl 
msslkpignivviccaffiifgil gyv qlfkgkffvcqgedtrnitnk 
sdcaeasyrwvrhkynfdnlgq a l mslfvlaskdgwvdim yd gl 
davgvdqqpimnhnpwmllyfiskllivaffvlnmfvgvvvenfh 
kcrqhqeeeearrreekrlrrle k\k rrnlmlddviasgssas aas 
eaqckpyysdysrfrllvhhlctshyldlfitgviglnvvtmame 
hyqqpqildealkicnyiftvifvfepfklvafgfrrffqdrwnq 
ldlaivllsimgitleeievnaslpinVtiirimrvlriarvlkllkm 
avgmralldtvmqalpqvgnlgllfymllffifaalgvelfgdlec 
dethpceglgrhatfrnfgmafltl FiR vstgdnwngimkdtlrdc 
dqestcyntvispiyfvsfvltaqfvl' 
keeaeleaelelemfctlspqphs 
aphttahigaasgfslehptmvphpee' 
thslpndsymcrngstaerslghrgwi 
tscilqlpkdvhyllqphgaptwgaip] 
airtdsldvqglgsredllsevsgps 



NVVIAVLMKHLEESNKEA 
'FLWPGVEGVNSPDSPKPG 
PVPLGPDLLTVRKSGVSR 
ILPKAQSGSILSVHS QP AD 
\LPPPGRSPLAQRPLRRQA 
iTRSSSFWGGSSIQ VQQR 



SGIQSKVSKHIRLPAPCPGLEPSWAKDP P\£ TRSSLELDTELS WIS G 
DLLPSSQEEPLSPRDLKKCYSVETQSCR R^R PGSWLDEQRRHS IAV 
SCLDSGSQPRLCPSPSSLGGQPLGGPGSRBKKKLSPPSISIDPPESQ 
GSRPPCSPGVCLRRRAPASDSKDPSVSSPIiDSTAASPSPKKDTLSL 
SGLSSDPTDMDP*VLPTLPHHLSPPGADPSSASWAAFLKSPT*AAS 
S H2071 EAPHLPSSVAGGDDEQNFRRVDLKRTjQPWSPCLREEGKGE 

s p vwprl-ptpga/ . 

5£« % 



SEQ ID NO: 5: SKEKQMA 

SEQ ID NO: 6: 5 ' TNGC (A/C/T) ATGGAG (C/A) GNCC (C/T) - 3 ' 
SEQ ID NO: 7: 5 ' -CTT (C/G/T) CCCTTGAA (G/ C) A (G/ A) CTG) - 3 ' 
SEQ ID NO: 8 : 5 ' -CCGCTGTCGGAGACCATGGAGACC- 3 ' 
SEQ ID NO : 9 : 5 ' -AGCGSCCCAAAATTGACCCCCACAG- 3 ' 
SEQ ID NO: 10: 5 ' - GAAGATGCGAGTGGACAG - 3 1 
SEQ ID NO: 11: 5'- CTGTGGCGATGGTCACTG-3 1 



